Coflow Scheduling in Data Centers: Routing and Bandwidth Allocation
نویسندگان
چکیده
In distributed computing frameworks like MapReduce, Spark, and Dyrad, a coflow is set of flows transferring data between two stages job. The job cannot start its next stage unless all in the finish. To improve execution performance such job, it crucial to reduce completion time coflow, as can contribute more than 50 percent time. While several schedulers have been proposed, we observe that routing, factor greatly impacting Coflow Completion Time (CCT), has not well considered. this article, focus on scheduling problem jointly consider routing bandwidth allocation. We begin by providing an analytical solution optimal allocation with pre-determined routes. following, formulate single Non-linear Mixed Integer Programming present relaxed convex optimization problem. further propose algorithms, CoRBA simplified version: CoRBA-fast solve joint consideration Lastly, address multiple coflows online scheduling, scheduler named OnCoRBA. By comparing start-of-the-art algorithms via simulations, demonstrate CCT 30-400 OnCoRBA reduces average 20-230 percent. addition, be hundreds times faster around 8 degradation compared CoRBA, which makes use very appropriate practice.
منابع مشابه
Multi-hop Coflow Routing and Scheduling in Data Centers
Communication in data centers often involves many parallel flows that all share the same performance goal. A useful abstraction, coflow, is proposed to express the communication requirements of prevalent data parallel paradigms. The multiple coflow routing and scheduling problem faces challenges when deriving a good theoretical performance ratio because coexisting coflows will compete for the s...
متن کاملFair Virtual Bandwidth Allocation Model in Virtual Data Centers
Network virtualization opens a promising way to support diverse applications over a shared substrate by running multiple virtual networks. Current virtual data centers in cloud computing have flexible mechanisms to partition compute resources but provide little control over how tenants share the network. This paper proposes a utility-maximization model for bandwidth sharing between virtual netw...
متن کاملBandwidth Allocation and Session Scheduling using SIP
Session Initiation Protocol (SIP) is a new signaling protocol designed to establish multimedia sessions in telecommunication networks. In this paper, we suggest the extension of SIP functionalities to coordinate QoS mechanisms deployed in IP networks, and especially in DiffServ domain. Indeed, the interaction between small and big TCP sessions may have dramatic consequences on small TCP session...
متن کاملIntelligent Routing and Bandwidth Allocation in Wireless Networks
Sensor networks and satellite constellations face a number of challenges for reliable and robust communications. Increasingly heterogeneous nodes and a multitude of new emerging Earth science applications put additional restrictions on throughput and delay requirements. These problems are further aggravated by energy and bandwidth constraints on the network nodes. Quality of service and perform...
متن کاملExperimental Analysis of Algorithms for Coflow Scheduling
Modern data centers face new scheduling challenges in optimizing job-level performance objectives, where a significant challenge is the scheduling of highly parallel data flows with a common performance goal (e.g., the shuffle operations in MapReduce applications). Chowdhury and Stoica [6] introduced the coflow abstraction to capture these parallel communication patterns, and Chowdhury et al. [...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Parallel and Distributed Systems
سال: 2021
ISSN: ['1045-9219', '1558-2183', '2161-9883']
DOI: https://doi.org/10.1109/tpds.2021.3068424